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The invention relates to a parametric encoder and method for encoding an 
audio or speech signal into sinusoidal code data according to the preambles of claims 1 and 6, 
respectively. 

The invention further relates to a parametric decoder and method for re- 
constructing an approximation of said audio or speech signal from said sinusoidal code data 
according to the preambles of claims 1 1 and 12, respectively. 

Audio and speech signals are preferably encoded before being transmitted via 
a channel or stored on a storage medium in order to compress the data of said signals. Audio 
or speech signals are substantially represented by sinusoidal code data and consequently 
specific encoders are known in the art specialised for the encoding of these signals. Such a 
parametric encoder is e.g. known from E.B. George and MJ.T. Smith, "A new speech coding 
model based on a least-squares sinusoidal representation". In Proc. 1987 Int. Conf. Acoust. 
Speech Signal Process. (ICASSP87), pages 1641-1644, Dallas TX, 6-9 April 1987. IEEE, 
Picataway, NJ. The parametric encoder described there is illustrated in Fig. 5. According to 
Fig, 5 the parametric encoder 500 comprises a segmentation unit 510 for segmenting a 
received audio or speech signal s into at least one finite segment x(n). 

Said segment x(n) is input to a calculation unit 520. Said calculation unit 520 
calculates sinusoidal code data in the form of phase and amplitude data of a given extension 
x from the segment x(n) such that the extension x approximates the segment x(n) as good 
as possible for a given criterion, e.g. minimum of weighted squared error. For the cited 
parametric encoder the extension is given by 
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ic(«) = £^'(«)cos(0'(n)) 
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with and ^ are polynomial coefficients of the amplitude parameter A 1 and of the phase 
parameter <D' . 

The calculation unit 520 comprises a frequency estimation unit 522 for 
calculation the phase coefficients (j) l k from the received segment x(n) for example, for k = 1 

(thus <p[ ), by picking frequencies in the frequency spectrum of said segment x(n). These 

phase coefficients <fi l k represent the phase part of said sinusoidal code data are on one hand 
output to a multiplexer 530 and are on the other hand input into a pattern generation unit 524. 
Said pattern generation unit serves for calculating the phase parameter (ri) according to 
equation (3). 

The pattern generation unit 524 further generates a plurality of JxL 
components py of the extension x(n) according to 

p y (n) = cosCOXn)), with i = 1-Lj = O-(J-l) 

The plurality of JxL components py is input to an amplitude estimation unit 526 which 
determines the optimal amplitude data a' from said received components as well as from 
the received segment x(n) output from the segmentation unit 510. 

The phase coefficients <j>[ and the amplitudes a 1 form the sinusoidal code 

data which represents the extension x(n) as an approximation of the segment x(n). These 

sinusoidal code data are multiplexed by the multiplexer 530 in order to form a data stream 
which may be stored on a recording medium or transmitted via a channel. 

The extension xiri) as described by equation 1 and as known from the 
described parametric encoder 500 provides a proper approximation for an individual 
segments x(n) of the audio or speech signal. However, the calculation of the sinusoidal code 
data is rather complicated. 

Starting from that prior art it is an object of the invention to improve a known 
parametric encoder and method for encoding an audio or speech signal into sinusoidal code 
data and to improve a known parametric decoder and method for re-constructing an 
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approximation of said audio or speech signal from said sinusoidal code data after 
transmission or restoration such that the calculation of said sinusoidal code data can be 
carried out in a simpler and cheaper way. 

This object is solved by the subject matter of claim 1 . More specifically, this 

object is solved by adapting the calculation unit to calculate the sinusoidal code data O^d^ 
and e l } for the following extension x : 

m = tlfc'jfj («)cos(0' (»)) + e)f ] (»)sin(0' («))] 
with 

©'(") = ^0 l k n k 
wherein: 



i : represents a component of the extension X (n); 

j,k : represent parameters; 

n : represents a discrete time parameter; 

9 l k : represents the phase coefficient value as one of said sinusoidal 

code data 

fj : represents the jth instance out of the set of J linearly 

independent functions; 
© ! : is a phase; and 

dj , e l j : represent the linearly involved amplitude values of the 

components representing the amplitude parts of said sinusoidal code 
data. 



Advantageously, the optimisation problem occurring when trying to define the 
sinusoidal data such that the claimed extension x accurately describes a specific segment 
x(n) is easy to solve. The easy calculation results from the fact that except the phase 
coefficients 0 l k the amplitude data d l and e l } are linearly involved within the claimed 

extension x . Note that there does not appear a zeroth order phase coefficient in 0' , whereas 
such component exists in O' in the form of $ . 
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Further, advantageously the claimed extension x provides more degrees of 
freedom for defining the sinusoidal code data with the result, that the claimed extension X is 
broader than the extensions known in the art and provides a more accurate approximation of 
an individual segment x(n). 

According to a first embodiment of the invention the linearly independent 
function fj(n) is set to fj(n) = n 1 . In that way the claimed extension x is restricted to a 
polynomial extension. 

Further advantageous embodiments of the claimed parametric encoder and in 
particular of the claimed calculation unit are subject matter of the dependent encoder claims. 

The above identified object is further solved by a method for encoding an 
audio or speech signal as claimed in claim 6. The advantages and embodiments of the said 
method correspond to the advantages and embodiments as explained above for the parametric 
encoder. 

The above identified object is further solved by a parametric decoder for re- 
constructing an approximation X of an audio or speech signal from transmitted or restored 
code data according to claim 11. More specifically, the object is solved by adapting a known 
synthesiser to re-construct said segments x from said sinusoidal code data <p l k ,d] and 
e l j according to the following formula: 



20 x(n) = t2[^/ 7 («)cos(0'(»)) + e;/»sin(0'(«))] 



with 



Jc=\ 



wherein: 
i 

25 j,k 
n 

fj 



30 & 



represents a component of the extension x (n); 

represent parameters; 

represents a discrete time parameter; 

represents the jth instance out of the set of J linearly 

independent functions; 

represents the phase coefficient as one of said sinusoidal data 
is a phase parameter; and 
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d l , e'j : represent the linearly involved amplitude values of the 

components representing parts of said sinusoidal data. 
Advantageously, the calculation of the claimed extension X is easier than the 
calculation of the extensions known in the art. This is due to the linear involvement of the 
5 amplitude data d l and e l } within said extension and the omission of the zeroth-order phase 
coefficient. 

Due to the easy calculation of the extension x the reconstruction of the 
original audio or speech signal s in the form of its approximation x can be realised cheaper 
and quicker. 

1 0 The above identified object is further solved by the decoding method as 

claimed by claim 12. The advantages of said method correspond to the advantages mentioned 
above by referring to the parametric decoder. 

Five figures are accompanying the description, wherein 

Fig. 1 shows a first embodiment of the parametric encoder according to the 



1 5 invention; 



invention; 



Fig. 2 shows a second embodiment of the parametric encoder according to the 



Fig. 3 shows a flow chart illustrating the operation of the second embodiment 
of the parametric encoder according to the invention; 
20 Fig. 4 shows a parametric decoder according to an embodiment of the 

invention; and 

Fig. 5 shows a parametric encoder as known in the art. 



25 Before describing the preferred embodiments of the invention some basic 

explanations about the subject matter of the invention are given. 

The invention proposes an extension x(n) for approximating a segment x(n) 

of a sinusoidal audio or speech signal s. Said extension x(n) is represented by phase and 
amplitude data, hereinafter also referred to as sinusoidal code data. The sinusoidal code data 
30 is defined such that the extension x(n) approximates the segment x(n) of the audio or 

speech signal as good as possible for a given criterion, e.g. minimisation of the squared 
weighted error. Expressed in other words, the sinusoidal code data has to be defined by 
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solving an optimisation problem. After the sinusoidal code data has been defined for 
optimally approximating a particular segment x(n) it might be stored on a storage medium or 
transmitted via a channel as code data representing said segment x(n) and thus also 
representing said audio or speech signal s. Preferably, before being stored or transmitted the 
sinusoidal code data might be encoded and/or cleaned in the way that irrelevant or redundant 
data is removed from it. 

The generation of said sinusoidal code data according to a first embodiment is 
now explained by referring to Fig. 1 . 

Fig. 1 shows a first preferred embodiment of a parametric encoder 100 for 
generating said sinusoidal code data representing an input audio or speech signal s. The 
received signal s is input to a segmentation unit 1 10 for segmenting said signal s into at least 
one segment x(n). Said segment x(n) is input into a calculation unit 120 for generating said 
sinusoidal code data such that the extension x with 



m = t2k/,(*)c°s(0'(")) + </>)sin(0'()O)] 

/=1 7=0 



(4) 



with 



K 



(5) 



and wherein: 



ij,k 



represent parameters; 

represents a discrete time parameter; 



n 



represents the phase coefficient as one of said sinusoidal data 



represents the jth instance out of the set of J linearly 
independent functions; 
is a phase; and 




represent the linearly involved amplitude values of the 



components representing parts of said sinusoidal data 
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approximates the segment x(n) input to said calculation unit 120 as good as possible for a 
given criterion, e.g. minimisation of weighted squared error. The sinusoidal code data to be 
determined by said calculation unit 120 is the phase 9[ and the amplitude data d l and e l } . 
The term Q of equation (4) with 

5 Ci= g[^/ y ( W )cos(0'( W )) + e ;//«)sin(0 I («))] (6) 

7=0 



is hereinafter referred to as the i'th component of the extension x with i = 1-L. 

The calculation unit 120 comprises a frequency estimation unit 122 for 
determining a plurality of LxK phase coefficients 0[ with k = 1-K for all components Ci 
10 with i = 1-L of the extension x(n) according to formula (5) representing the individually 
received segment x(n). Said plurality of LxK frequencies 0 l k is input to a pattern generating 

unit 124 for calculating a plurality of L frequency parameters & (n) with i = 1-L according 
to formula (5). Said pattern generating unit 124 is further adapted for generating a plurality of 
JxL pairs of patterns p\*p\ , for the components Ci with i = 1-L according to: 



p;-fj(n)cos(0 ? («));and 
p>=fj(n)sin(0^)) 



fori - 1-L andj =0-(J-l). 
20 Said plurality of pairs of patterns p\ } , p* is - together with the segment x(n)- 

input to an amplitude estimation unit 126 for determining a plurality of JxL amplitude data 
dj for all received patterns p\ } and a plurality of JxL amplitude data e l } for all the received 

patterns p* of all components Ci of the extension x(n) . 

The calculation unit 120 and in particular the frequency estimation unit 122 
25 and the amplitude estimation unit 126 are adapted such that the sinusoidal data comprising 
the phase data G[ and the amplitude data dj and e l } is determined and optimised such that 

the criterion "minimisation of weighted squared error E between the segment x(n) and the 
extension x(n) " is (approximately) fulfilled. 
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The parametric encoder 100 may further comprise a multiplexer 130 for 
transforming the plurality of LxK phase coefficients 0 l k as output by said frequency 
estimation unit 122 and said plurality of JxL amplitude data d l . and e l } as output by said 

amplitude estimation unit 126 into a data stream to be stored on a storage medium or to be 
transmitted via a channel. 

Fig. 2 shows a second embodiment of the parametric encoder 100'. Like the 
parametric encoder 100 the parametric encoder 100' also serves for generating said 
sinusoidal code data from the input audio or speech signal s. The operation of its 
segmentation unit 110' corresponds to the operation of the segmentation unit 110. 
Consequently, the segmentation unit 110' generates segments x(n) of the received signal s at 
its output. Said segments x(n) are input to a calculation unit 120'. In difference to the first 
embodiment of the calculation unit 120 the calculation unit 120' does not calculate the 
plurality of sinusoidal code data simultaneously for all components of a segment x (n) but 
generates this sinusoidal code data sequentially for each component Ci with i = 1-L of the 
extension x . This way of calculation is generally known in the art as analysis-by-synthesis or 
as matching pursuit algorithm. However, in the prior art an application of said method is only 
known for extensions different from the claimed extension x according to formula (4). 

In the following the operation of said second embodiment of the calculation 
unit 120' is explained by referring to Figs. 2 and 3. More specifically, the calculation of the 
sinusoidal code data of the extension x according to equation (4) is described such that the 
weighted squared error between a segment output by the segmentation unit 110' and its 
extension x according to equation (4) is (approximately) minimised. 

In a first cycle i = 1 the sinusoidal code data of a first component Ci with i = 1 
of the extension x are calculated (method step a) in Fig. 3). 

For achieving this, the output of segmentation unit 1 10' x(n) is set to: sm = 
x(n) (see method step b)). 

In said first cycle, said output of the segmentation unit 1 10' is input to a 
frequency estimation unit 122' for determining a plurality of K phase coefficients 9 l k with k 
= 1-K from the input value Si_i (see method step c)). Said phase coefficients 0[ represent the 
phases of the searched sinusoidal code data and are thus output from the calculation unit. 
Moreover, said phase coefficients 6[ are input to a pattern generating unit 124' for 

calculating the phase & with i = 1 for the first component CI according to equation (5) (see 
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method step d)). Said pattern generating unit 124' further serves for generating a plurality of 
2xJ patterns with j = O-(J-l) for the component Ci with: 

pl=fj(n)cos(G\n));and 

for i = 1 (see method step e)). These generated patterns p\ 9 p^ are - together with the 
parameter £i_i - input to an amplitude estimation unit 126'. Said amplitude estimation unit 
126' serves for determining a plurality of J amplitudes d l } for said patterns p] and of J 

amplitudes for said patterns p* for the component Ci with i = 1 from the received input 
data (see method step f)). Said calculated amplitudes d' and e) form the amplitude part of 
the sinusoidal data representing the extension x of the segment x(n) and are thus output from 
that calculation unit 120' in order to be - together with said phase data 6[ merged into a data 
stream representing said first component Ci with i = 1. Moreover, said amplitude data d' 
and e l j are - together with their respective patterns p\ } and p 2 input into a synthesiser 128' 
for calculating the component Ci with i = 1 according to 

c i = £ l d jfj («)<*>*(©' («)) + e )f (n) sin(0< («))] 

7=0 

(see method step g)). 

Said component Ci is input into a substracting unit 129' for being substracted 
from the value sm being input to said frequency estimation unit 122'. The difference 
occuring at the output of said substracting unit 129' is referred to as with i = 1 (see method 
step h)). 

Now the first cycle for calculating the first component CI and its sinusoidal 
code data 6[ , d l } and e l y for the extension x has been finished. Subsequently, the parameter 

i is compared with the total number L of components Ci of the segment x (see method step 
i)). If i < L method steps c) to i) are repeated for i - In these cases the output from the 
segmentation unit 1 10' for i > 1 is disconnected from the input of the frequency estimation 
unit 1 22'; instead, the input of said frequency estimation unit 122' is connected to the output 
of said substracting unit 129' for receiving the differences Si. However, if i > L the 
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sinusoidal code data of all L components of the extension x have been calculated and thus 
the calculation process carried out by the calculation unit 120' has been finished for a 
particular segment x . Subsequently, the whole procedure may be repeated for a subsequent 
segment of the input audio or speech signal. 

Fig. 4 shows a parametric decoder 400 for reconstructing an approximation s 
of an audio or speech signal s from received input data. These received input data correspond 
to data of a data stream after being transmitted or restored from a storage medium. 

The parametric decoder 400 comprises a selecting unit 420 for selecting 
sinusoidal code data 0[ , d l 3 and e\ representing segments x of the approximation s of the 

audio and/or speech signal s from said received input data. The parametric decoder 400 
further comprises a synthesiser 440 for reconstructing said segments x from said received 
sinusoidal code data and a joining unit 460 for re-constructing the approximation s by 
linking the re-constructed segment x . 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. In the claims, any 
reference signs placed between parentheses shall not be construed as limiting the claim. The 
word 'comprising' does not exclude the presence of other elements or steps than those listed 
in a claim. The invention can be implemented by means of hardware comprising several 
distinct elements, and by means of a suitably programmed computer. In a device claim 
enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere fact that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 



